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Executive Summary 


The ABS is currently undertaking two projects using data from the 1998 National Crime and 
Safety Survey. These projects are being done in collaboration with the Australian Institute of 
Criminology and the ABS National Centre for Crime and Justice Statistics in Melbourne. 


The two projects currently under way are: 
@ deriving small area estimates of crime prevalence and incidence rates; and 
@ estimating the propensity to report crime. 


Direct survey estimation of variables at small area level yields unacceptably high standard 
errors due to the small sample sizes. Increasing the sample size can overcome this problem, 
but this is an expensive option. The small area estimates of crime project is an attempt to 
derive small area estimates of crime rates using a combination of the 1998 National Crime 
and Safety Survey and 1996 Census data. Results from this project will lead to a greater 
understanding of the crime in local areas by practitioners in the field. 


It is proposed that a Logit model is used to model the propensity to report and to test the 
significance of socio-economic factors that affect an individual's reporting decision. Results 
from this model will give practitioners in the field a better understanding of which groups 
have a higher or lower propensity to report crime. 


Analysis Branch is involving criminologists and practitioners in this work through the project 
board, and through a peer review group reviewing the methods. The project board has a 
Police representative and a criminologist from the Australian Iinstitute of Criminology on it. 
The peer review group consists of criminologists and statsiticians in the criminology field. 


Approach 


Modelling the Crime and Safety Survey 

A major challenge in terms of methodology is how to accommodate and adjust for the survey 
design which has a combination of stratification and clustering aspects. A number of 
techniques to take into account survey design have been considered. It is proposed that 
survey design is taken into account through a weighting system in the proposed estimation 
procedure (either using replicate weights or a weighted least squares technique). 


Small area estimation of crime rates 

Small area estimation applies to a range of techniques that allow the calculation of estimates 
for small areas from larger areas, using survey and other data. The techniques essentially 
"borrow strength" from related areas to find more accurate estimates for a given area. 


The small area technique proposed in this study is based on a regression approach. Under 
this method regression coefficients are estimated from the survey for larger areas. Then these 
regression coefficients are applied to census data from smaller areas to derive estimates for 
these smaller areas. 


Estimation of the propensity to report crime to police 

It is proposed that a Logit model is used to model the propensity to report crime. The model 
will be estimated using data from the 1998 National Crime and Safety Survey. The dependent 
(binary) variable is the decision to report the crime. Independent variables include age, sex, 
whether a weapon was used, injury and whether anything was stolen. For this model, use is 
made of unit record survey data. 


Data sources and variables 


The main source of data for the two projects is the 1998 National Crime and Safety Survey. 
This survey covered the major crimes of break and enter, attempted break and enter, motor 

vehicle theft, robbery, assault, and sexual assault. These crime types can be joined into two 
broad crime groups, being personal crime (assault, sexual assault and robbery) and Property 
Crime (break in, attempted break in, and motor vehicle theft). 


The dependent variables for the small area estimates model are the victimisation prevalence 
and incidence rates. Different models are estimated for each crime type, and broad crime 
group (Personal and Property crimes). 


The independent variables for the small area estimates model came from the 1996 Census. 
The Census has full coverage of Australia, and provides reliable population figures at a small 
area level. Some independent variables (for assault only) also came from the 1998 National 
Crime and Safety Survey. 


The dependent variable for the propensity to report model is the ‘reported to police’ variable 
from the 1998 National Crime and Safety Survey. The independent variables for this model 
are from the 1998 National Crime and Safety Survey. 


Robustness 


The following tests for robustness are proposed. For the small area estimates model these 
tests include: 


e Simple aggregation of the small area estimates of crime and comparison of these 
aggregates with reliable survey estimates; 


e Sensitivity analysis of the model to investigate how small area estimates vary as 
selected key independent variables change; and 


e Use of the coefficients from the 1998 National Crime and Safety Survey against the 
1991 Census data, and compare the rates estimated using these data with the 
victimisation rates in the 1993 National Crime and Safety Survey. 


For the propensity to report crime project, they are: 


e Sensitivity analysis of the model; and 


e@ Run the same model against the 1993 Crime and Safety Survey and compare the 
results. 


Results 


This paper is exploring methodology to be used in the study. While some draft results may be 
available by the meeting, we do not present any results int his paper. 


Proposed Discussion Points for MAC 
We would welcome MAC's comments on any aspect of this paper. Particularly valuable 
would be comments on: 
(i) the methodology, techniques and research strategies we have used; and 


(ii) what conceptual and statistical frameworks might we use for our longer term research on 
Crime. 


@ Weare particularly interested in views on the small area estimation technique used, since 
this is a generic method and could be used for other surveys. Have we adopted the best 
techniques for small area estimation? Have we got the trade off between simplicity and 
statistical rigour about right? 


e Is the method we have used to take survey design into account appropriate? What 
improvements can we make to the proposed method, and what are the costs (in terms of 
complexity) and benefits (in terms of better estimates) associated with the improvements? 


@ Have we adopted the right solution to multicollinearity? 


e Are our tests for robustness appropriate? Are there any other tests we could do? 
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1. Introduction 


1. This paper discusses the methods proposed for use in two projects currently under 
way: 


e@ deriving small area estimates of crime prevalence and incidence rates; and 

@ estimating the propensity to report crime. 
2: These projects use the 1998 National Crime and Safety Survey (NCSS) to model 
crime rates and the propensity to report crime. They were started in December 2000, and the 


planned completion date is late 2001. 


The aim of this paper is to: 


© describe the methods we are using to estimate the models; and 


@ discuss the issues we face with the estimation. 


a The paper has four sections. Section 2 discusses aspects of methodology which apply 
both to the small area estimation project and the propensity to report crime to police project. 
Section 3 discusses methods which are specific to small area estimation, while section 4 deals 
with methods specific to the estimation of the propensity to report. Section 5 makes some 
concluding remarks. 


2. The 1998 National Crime and Safety Survey 

2.1.‘ Description of the Survey 

4, This survey was conducted in April 1998, with the reference period April 1997 to 
April 1998. The survey covered the more serious crimes that affect the largest number of 


people, being: 


e@ break-in - an incident where the respondents home had been broken into. The home 
included their garage or shed, but excluded their car and garden. 


@ attempted break-in - an incident where there were signs that an attempt was made to 
break into the respondents home. The home included their garage or shed, but 
excluded their car and garden. 


e motor vehicle theft - an incident where a motor vehicle was stolen from any member 
of the household. 


@ assault - an incident other than a robbery involving the use, attempted use, or threat of 
force or violence against the victim. 


@ sexual assault - an incident which was of a sexual nature involving physical contact, 
including rape, indecent assault, and assault with intent to sexually assault. Sexual 
harassment was excluded. Only females aged 18 years and over were asked sexual 
assault questions. 


@ robbery - an incident where someone had stolen (or tried to steal) property from a 
respondent by physically attacking them or threatening them with violence. 


Ds The 1998 National Crime and Safety Survey was a supplement to the ABS April 
Monthly Population Survey. Information was sought from 51,800 persons, of whom about 
42,200 (81.4%) responded. Data pertaining to households was sought from 25,600 
households, of which about 20,900 (81.6%) responded. The respondents were all persons 
aged 15 years and over who were the usual residents of private dwellings. Residents of 
non-private dwellings (hospitals, motels and prisons) were excluded from the sample, as were 
those living in remote or sparsely settled areas. 


6. The sample size meant that reliable estimates can be achieved for some statistical 
regions (Tables 9.1 and 9.2 in ABS: 1999b show estimates for Statistical Regions), but 
geographic areas below this suffer from small sample size. Therefore, small area estimation 
techniques are needed to calculate estimates for small areas. 


2.2 Sample Design 


‘De The NCSS uses a complicated sampling design, with clustering and different strata, to 
reduce the cost of running the survey. The effect of the clustering is that households are 
chosen in groups. The 1998 National Crime and Safety Survey sample is clustered for the 
practical reason of reducing the costs associated with forming a list of dwellings and of 
interviewer travel between selected dwellings. The more highly clustered the sample, the 
cheaper it is to enumerate, but the less reliable the estimates derived from the sample since 
the sample is confined to a small area and would be unrepresentative of the whole area. 


8. Stratification improves the accuracy of sample estimates by ensuring that different 
groups in the population are correctly represented. It also makes estimation of results for 
various geographic areas both easier and more accurate. 


Q: However, stratification means not all observations have the same probability of being 
chosen, so OLS regression, which assumes random sampling, may not lead to Best Linear 
Unbiased (BLU) estimates. 


10. Options for taking into account survey design are: 


e Ignore the effect of clustering, and take into account stratification by including the 
survey weights in the model. These are available on the unit record file. These can be 
included in the model using a SAS WEIGHT statement (with the weights adjusted for 
survey size), or the variables that the weighting depended on can be included in the 
model. 


e Estimate the model taking into account the sample design (both clustering and 
stratification). This can be done through the SAS SURVEYREG procedure, the 
WESVAR program, or SAS with a program to calculate variances from replicate 
weights. WESVAR uses replicate weights to adjust the variance for design effects, 
and this can also be done with a SAS program. SAS PROC SURVEYREG estimates 
a variance-covariance matrix using a Taylor series expansion. 


11. The first option is fairly easy to implement, and we will run some models of this type 
to compare to models that take clustering into account. 


12. For the second option, to calculate the coefficients, both WESVAR and SAS 
SURVEYREG use weighted least squares. Estimates for the regression coefficients are 
calculated as: 


B=(X'WX)'(X'WY) 


where W is a diagonal matrix constructed from the survey weights (and replicate weights for 
WESVAR), X is the matrix of independent variables, and Y is the matrix of dependent 
variables. WESVAR then uses 


X'WX for the full sample and each replicate, and solves the equations using a SWEEP 
algorithm. 


13: WESVAR uses replicate weights to estimate the variances of the estimates. This 
procedure uses 30 different samples from the data to estimate 30 different weights. Analyses 
is done separately using each of the replicate weights, and again using the overall weight, and 
the differences between the outcomes can be used to measure the standard error. The 
variance using WESVAR is estimated as: 


G 


Vz =CY By — Oy bY 


k=1 


where G is the number of replicates, and c is a weight depending on which method for 
calculating the replicate weights has been used. 


14. To estimate the variance, SAS PROC SURVEYREG uses a Taylor series expansion 
method to estimate the covariance-variance matrix, and therefore does not need the replicate 
weights. More on the Taylor Expansion can be found in Appendix 1. 


15. PROC SURVEYREG does need the stratum sampling rates or a dataset with the 
stratification variables and the population totals to estimate the variances. These are not 
available for the small area model. They are available for the propensity to report model, but 
PROC SURVEYREG does not allow estimation of logistic regression models. This means 
PROC SURVEYREG cannot be used for either of our models. 


16. SAS can be used to calculate the regression coefficients using WLS, with the 
estimates being increased to population estimates using the survey weights, and the weights 
for the WLS being I/SE. The variances can then be estimated in SAS using the replicate 
weights. The variance would be calculated as: 


NumRe ps 
eS (Re plicate, — Est)” } (Numreps —1)/ NumRe ps 


n=l 
This means all estimation is in SAS, rather than using another program (WESVAR). 


lige We would like to estimate the coefficients and variances using a number of different 
methods, to give us some idea of the effect each method has on the final results. The 
methods proposed for each model are described in the sections on estimating the models. 


Key Issue - Survey design 


Analysis Branch is likely to confront the issue of how best to take into account survey 
design in other projects which record unit survey data. 


A. In what ways can complex survey design affect modelling and other analyses? 


B. Is the effect of sample design on econometric models significant? 

C. Have we chosen the best methods to take it into account? 

D. Has there been any work done on comparing models which take the different varieties 
of survey design into account and models that don't? 


2.3 "Not stated's 
18. The other issue common to both projects relates to the 'not stated's on the file. These 
are items on the survey form where no answers were provided by respondents. There is no 
knowledge as to what the actual values could be. They only affect some (not all) variables 
on the file, and most of the variables affected will not be used in the proposed models. The 
highest proportion was for the "Whether security measures added in the last 12 months" 
question, where the not stated response was 3.4% of the total response. For the rest, the 'not 
stated's constitute between 0.5% and 1% of the total response. 
19. For a regression model, there are four ways to treat ‘not stated's: 

e@ Leave them in as a separate category and in the totals; 

e@ Take them out, reducing the totals; 


e Pro rata them out to the other categories, or separately within imputation groups; or 


@ Model the Stateds and 'not stated's separately. 


20. Appendix 2 shows a list of variables affected by a 'not stated’ classification; and the 
extent of the 'not stated's. Because (a) the effect of the 'not stated's is not likely to be large 
because they form a small proportion of total response, and (b) the 'not stated's only exist for 
variables that may not be used in the modelling, it is proposed that they are left in the data file 
as a Separate category, and no attempt is made to pro-rata or remove them. Any totals are left 
with the 'not stated's in. Basing the model on proportions calculated using a total with ‘not 
stated's out is the same as pro-rataing the ‘not stated's, which assumes a distribution for the 
‘not stated's that may not be correct. 


Key Issue - How to deal with 'not stated's 


Have we used the most appropriate way of dealing with the 'not stated's? 


3. Small Area Estimates of victimisation 


21. The aim of this project is to derive estimates for crime rates for areas smaller than can 
be supported by the 1998 NCSS. 


3.1 Small area estimation techniques 


2d: There are a number of techniques under the broad heading small area estimation. 
These include synthetic estimation, simple empirical and hierarchical Bayes; and Empirical 
Best Linear Unbiased Prediction (see Ghosh, M & Rao, J: 1994 for a summary of these 
techniques). 


23: We are using a a synthetic estimation technique, because the broad technique 
(regression modelling) is well known, and is understood in the criminological field. Broadly 
synthetic estimation calculates relationships from aggregate survey data and applies them to 
small area Census data. The relationships can be simple ratios, or regression coefficients. 
Examples of synthetic estimation are in Purcell, N & Linacre, S: 1976; Ghosh, M & Rao, J: 
1994; and Stasny, E, Goel, P & Rumsey, D: 1991. 


24. The disadvantage with synthetic estimation is that it assumes that the victimisation 
rates for the chosen socio-demographic groups from the survey data are the same as the 
victimisation rate for that group at the local level. The regression modelling, by plotting a 
line through the independent variable observations, is averaging the independent variables 
across these regions. The slope of these lines (the coefficients) are then being used to predict 
the smaller areas. The estimates for the smaller areas are likely to break down if the small 
area does not have similar coefficients for the independent variables to the larger area. This 
could happen if there is some other characteristic of the small area which the model does not 
take into account. For instance, the synthetic estimate may be a 10% Motor Vehicle Theft 
rate on Cocos Island, given the socio-demographic composition of the population. However, 
the true rate is 0%, because there is no market for stolen cars; and there are very few cars on 
the Island. This could be taken into account by including the number of motor vehicles in the 
model. 


Key Issue - Choice of small area estimation technique 


Is synthetic estimation using a regression model the most appropriate small area estimation 
technique given the data available and the target audience for the small area estimates? 


3.2 Description of the approach to the estimation of small area crime victimisation rates 
25. The approach adopted for this study has six main steps: 

Step 1 - Choice of levels of geography to use in the study 

Step 2 - Choice and construction of measures of victimisation 

Step 3 - Merge estimates of crime victimisation from NCSS 1998 with Census 1996 


Step 4 - Estimate the regression equation (at ABS Statistical Subdivision) using NCSS 1998 
and Census 1996 


Step 5 - Apply the model estimated in Step 4 to derive small area estimates of crime 
prevalence and incidence rates 


Step 6 - Validate the small area estimates 
The rest of the section discusses each one of these steps in turn. 


Step 1 - Choice of levels of geography to use in the study 


26. Figure 1 shows the Australian Standard Geographical Classification Areas (ASGC). 
The geography we are using from the NCSS is the statistical region. Estimates of crime by 
statistical region are provided in ABS: 1999b. The Census data allows one to apply the 
models at the Census Collection District (CD) geographic area. However, this level of 
geography will provide unreliable estimates because local effects are not being taken into 
account in the model. In this study: 


e the regression equation is estimated at the Statistical Region; but 
e@ the small area estimates are derived at Statistical Local Area level. 


27. Both these classifications cover the whole of Australia. 


28. The statistical region level was chosen for the estimation of the model because most 
of the estimates at this level are reliable. Those which are not will be taken into account in 
the implicit averaging done by the weighted least squares regression model. Because of the 
stratification, weigthted least squares is already being used in the estimation of the model. 
To get the best estimates taking into account the standard errors, the weight should be 1/SE, 
so those areas with a high SE get a lower weight. The stratification uses the survey weight 
for the Weighted Leat Squares estimation. There is some similarity between these two, and 
we think they are similar enough to use the survey weight as a weight to take into account 
the standard errors of some areas. A graph of the survey weight and 1/SE is shown in 
Appendix 3. 


29. The Statistical Local Area geography is small enough to cover suburbs in urban areas, 
but is not so small that the estimates of crime are localised. It is a geography that 


practitioners in the peer review group suggested would be useful to have estimates for. 


Figure 1: 
Australian Standard Geographical Classification Areas (ASGC) 
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Step 2 - Choice and construction of measures of victimisation 


30. Let H. = the total number of households in region 1; 


P =the total number of persons in region i who are at least 15 years old; 
6 


nj = Lif household h inregion i is (for atleast | time) avictimof crime j, 
= 0, otherwise. 


Jj. ={Break in, Attempted break in, Motor vehicle theft} 


Vin =1if person p inregion i is( for atleast | time) avictimof crime k, 
=0, otherwise. 


k  ={Robbery, Assault, Sexual assault} 


n, 1s the number of times crime | (which could be household or personal) has occurred to 
victim r 


Sk. In the following equations, prevalence is the number of the relevant population that 
have been a victim of a given offence at least once in the reference period. Incidence is the 
total number of incidents of the offence that occurred in the reference period. As some 
victims experience repeated incidents of victimisation, incidence numbers are typically higher 
than prevalence numbers. 


Household crime prevalence rates 


(hy 
32, The prevalence rate for household crime 1 is denoted by “i; where 


(h) 
7 ={ the number of households which (over the last 12 months, for at least one time) have 


been victims of household crime j in region 1)/(the total number of households in 
region i}* 100; 


Aj 
=(>,4,/H,)*100 2 (la) 
h=l 


Personal crime prevalence rates 
(p) 
a2: The prevalence rate for personal crime k is denoted by Tix where 


(Pp) 
Tit ={ the number of persons who (over the last 12 months, for at least one time) have 


been victims to personal crime k in region 1)/(the total number of persons in region 1}* 
100; 


FB 
=()°V 4 / P;)*100 ------ (1b) 


p=! 


Household crime incidence rates 


(h) 
34. The incidence rate for household crime i is denoted by Ay where 


(hy 
Ai ={ the total number of households which (over the last 12 months) have been victims to 


crime j in region 1)/(the total number of households in region 1}* 100; 


H, 
=() 5,3 *n, /H,)*100 ------ (2a) 


h=l 


Personal crime incidence rates 


() 
33. The incidence rate for personal crime k is denoted by ae where 


(p) 
Rit ={ the total number of persons who (over the last 12 months) have been victims to 


personal crime k in region 1)/(the total number of persons in region i}* 100; 


=(SV ty, /B)*100 ee (2b) 
36. Equations (1a), (1b), (2a) and (2b) express the victimisation rates as percentages. 


These rates are derived from NCSS 1998. Percentages are used because they are easier to 
model than numbers of crime, which, because they are count data, follow a Poisson 
distribution rather than a normal distribution, so OLS regression is not appropriate. 


Step 3 - Merge estimates of crime victimisation from NCSS 1998 with Census 1996 


37. In this step, for each crime j, the study proposes to create matrices of the following 
form: 


Nag Lie Log ap Lag SY ce Va KH 


Ti, 4; 


Tse Wig? Zi, Zp ces Dag VN eae Meas 


T2? 


Re ene Aa AS RS (nO, Gene Se 


jd? 


38. The data matrix in equation (3) could be for either a personal crime or a property 
crime. The matrix comprises four main parts. 


Estimates of crime rates 


39. The data in first two columns would be generated from step 2 above and gives 
estimates of crime rates from NCSS 1998 at the Statistical Division level. These variables are 
taken from the 1998 National Crime and Safety Survey. The base is therefore persons over 
the age of 15 living in private dwellings. The denominator for the rates is the relevant 
population for the survey (persons aged 15 and over), as described in Section 2 above. 


Socio-demographic characteristics of residents from Census 1996 


AO. The Y matrix in (3) introduces the person-related attributes corresponding to the 
different regions as extracted from 1996 census. These attributes include income, 
unemployment, aboriginality, etc. The regression model is measuring the association between 
this population and victimisation. It does not follow that a significant association indicates 
that group is more likely to be victims of crime; they may be the offenders; or they may just 
always live in certain high or low crime areas. 


Al. Appendix 4 shows the list of variables currently considered for inclusion in the Y 
matrix in (3). All variables are expressed as the proportion of people in the sub-group, except 
income. This is expressed in three ways: 


1. Median Income; 
. Lowest quartile of income, for a measure of poverty; and 
3. Skewness, as a measure of the relative differences in income. The skewness measure 
2 
— Ns xi XN 
(N -1)/(N -2) 
Skewness = 5 
used is - . This is a standard measure given by 


the program we are using to extract the Census data. 


Geographic Region Characteristics from Census 1996 


42. The data in the columns of the Z matrix would be derived from 1996 Census and 
relate to variables associated with the different regions. They measure region characteristics 
like population density, dwelling tenure, and dwelling types. A full description of the 
variables is given in Appendix 4. Again, the denominator for any rates is the population aged 
15 and over. The reason for this is that the NCSS does not include victims under 15 years; 
and offenders aged under 15 in most of the categories collected by the Crime and Safety 
Survey are rare. 


Socio-demographic characteristics of victims from NCSS 1998 


43. It could be theorised that victim characteristics are an important determinant of 
victimisation for some crimes. For example, in the case of assault, it may be that higher 
victimisation is associated with certain characteristics of the victim. In (3) this data set is 
represented by the matrix X. 


44. All victim characteristics are expressed as rates, with the denominator for the 
victimisation data being the survey weighted population, which was persons who were at least 
15 years of age living in private dwellings in Australia in April 1998. 


45. The victimisation characteristics are from the National Crime and Safety Survey. 
Including victimisation characteristics will mean having to extract crosstabs from the NCSS, 
which will affect the reliability of the survey estimates. Thus a model that uses the X matrix 
is likely to be less reliable than a model using the socio-demographic characteristics Y which 
are extracted from Census 1996. Because of this, the first model only includes the Y 
characteristics, with the X characteristics will be added in later versions of the model. A full 
description of the variables is given in Appendix 4. 


Step 4 - Estimate the regression equation (at ABS Statistical Subdivision) using NCSS 


1998 and Census 1996 


46. In this step the study estimates 16 regression equations using data from NCSS at the 
Statistical Subdivision level and the corresponding 1996 Census data for selected variables. 
The 16 regression equations estimated are: 


(i) All household crime - prevalence rate 

(ii) All household crime - incidence rate 

(ii1) Household crime - Break in - prevalence rate 

(iv) Household crime - Break in - incidence rate 

(v) Household crime - Attempted Break in - prevalence rate 
(vi) Household crime - Attempted Break in - incidence rate 
(vit) Household crime - Motor vehicle theft - prevalence rate 
(viii) Household crime - Motor vehicle theft - incidence rate 


(ix) All personal crime - prevalence rate 

(x) All personal crime - incidence rate 

(xi) Personal crime - Robbery - prevalence rate 

(xii) Personal crime - Robbery - incidence rate 

(xiii) Personal crime - Assault- prevalence rate 

(xiv) Personal crime - Assault- incidence rate 

(xv) Personal crime - Sexual Assault- prevalence rate 
(xvi) Personal crime - Sexual Assault- incidence rate 


47. The estimated equations are of the following form: 


Household crime j prevalence rates 


Re Sad, Ltd gp Ze Se Fd gp 2 He, + ORY, +k Oar Yet fh uky ef Bky tf a Xp t ey, =H SSS (4a) 
Household crime j incidence rates 
AD = a,+d,, Z,+d),Z, + .. +dZ,+ uy, +ewY, +..tew Yt fuX, + fio X, +..+ fle X,+8,----- (4b) 
Personal crime k prevalence rates 

(s) = 
i. Sy PZ, Bi Ag Pa Bye 2 Pg i lo hae Fg MO My: HO Ke Pee RO Ap ey = (4c) 
Household crime j incidence rates 
ty =0,+ B.Z,+ BZ, +... +B Zp+ 7 LY +7 Yt + 7, Ye tOuX, +X, +..40 Xp +e, ----- (4d) 


48. This regression model will be estimated with both SAS and WESVAR, so the results 
can be compared. Unfortunately, the model cannot be estimated using PROC SURVEYREG 
in SAS because the stratification variables are not available in the aggregated dataset (the 

aggregation is larger than the stratification variables). The models we intend to estimate are: 


1. Base model with no adjustment for stratification or clustering. The NCSS data will be 
weighted before estimating the model. 

2. Base model estimated using WLS in SAS, with the weight being the adjusted survey 
weight. 

3. Model estimated in WESVAR using the replicate weights. 

4. Model estimated in SAS using WLS, with the weight being 1/SE (calculated from the 
replicate weights) and the survey estimates increased to populaiton estimates using the 
survey weights. 


Multicollinearity 


49. A key issue affecting the analysis under this step is multicollinearity. Because of the 
nature of the independent variables used in this model, there may be some multicollinearity. 
In particular, there are three measures for the three different types of income. The effect of 
multicollinearity is that the variance of the regression estimators may be over-estimated, and 
the regression coefficients may be very sensitive to small data changes (Johnston, J: 1991, p. 
248). 


50. This study proposes to test for multicollinearity in the following ways: 


e@ look at a simple correlation matrix to identify variables that are collinear; and 


@ use will be made of the Variance Inflation Factor which measures the extent 
multicollinearity inflates the variance of the regression estimators. The VIF shows how 
much higher the variance of the estimator is compared to what the variance would have 
been if the variable being tested was not collinear with any other variables in the 
regression. It is calculated as 1/(1-r), where r is the correlation coefficient of the variable 
in question regressed against all other independent variables. Variables with a VIF of 
more than 5 (ie, r= 0.8) need to be carefully reconsidered. 


SL: For an estimation model, the effect of multicollinearity on the estimation side is not 
likely to be great. Because of the effect on the variances, it is not possible to rely on the t 
statistics for significance tests. It is also makes problematic sensitivity analysis of the models 
(because of the effect of multicollinearity on the standard errors and the regression 
coefficients). 


D2. To drop the variables causing the multicollinearity may mean some important 
explanatory variables (from a criminology theory perspective) may need to be removed. We 
will need to be careful when removing variables to ensure that there is still some proxy for 
the underlying theory in the model. 


53: This study proposes to minimise the amount of multicollinearity in the model by 
removing variables causing some multicollinearity. However the final decision whether to 
exclude or include a variable will take into account (a) the VIF, (b) the variable's 
explanatory power in the model, and (c) the variable's theoretical significance. In particular, 
we expect to remove some of the measures of income from the model. We also expect some 
collinearity between qualifications and income. 


Key Issue - Multicollinearity 


Is the approach we have taken to the issue of multicollinearity appropriate? 


Step 5 - Apply the model in Step 4 to derive small area estimates of crime prevalence 
and incidence rates 


54. This step takes the coefficients from the estimated model in step 4 and Census data at 
a small geographic area to calculate small area victimisation estimates. The constant, which 
applies to the larger area, will be pro-rated to the smaller areas based on the estimates of 
crime without the constant. So the overall procedure will involve two steps. Equations (5a) - 
(5c) show the two steps in the case of household crime J. 


Household crime j prevalence rates in small area 'r' 
Ae a, =0 in Equation(4)} -----— (5a) 


nm‘ =Constant (r) + {7° 


(a \@e=0f Hag 2rd, Zt 4 hg Bete ta tego t fie tet ike 2 (5b) 


Constant (r) = a, Sa (a, =0))*(H, 1S H,) Bos Ge) 


"R'is the total number of regions fromthe NCSS survey. 


55. Equation (5a) is implemented first for all small areas R in Australia. Equation (5c) is 
then used to estimate a constant for small area 'r'’. 


56. Because the regression model coefficients provide estimates of the actual crime rate in 
the small area, the small area estimates will not add to the survey data. At this point, an 
adjustment could be made to the small area estimates to force the sum of the smaller areas to 
equal the larger area survey estimate. Equation (5c) is an example of such an adjustment. 
Other adjustments that could be made are shown in Stasny, E, Prem, G and Rumsey, D: 1991. 
These include forcing the sum of small areas to equal the large area, but also minimizing the 
sum of squared differences between the original and revised small area estimates; or 
minimizing the squared relative differences between the original and revised small area 
estimates. Stasny, E, Prem, G and Rumsey, D: 1991 found that a constant scaling factor 
provided the best results. 


D1: In the peer review group , there was a split between the methodologists, who didn't 
want this adjustment made; and the policy analysts and advisers, who did want it made to 
make it easier to explain the estimates and justify them to critical audiences including 
Ministers, newspapers, and similar groups. 


58. The other option that has been considered is to include into the estimation model the 
restriction that the sum of the small areas must equal the larger area totals. This would be 
similar to an income constraint commonly introduced in the estimation of systems of 
consumer demand equations (ie a consumer's expenditure must not exceed his or her budget). 
At this stage it is not clear how such an option can be implemented in the case of small area 
estimates of crime. It is harder to implement, short of estimating using a Bayesian method 
and using this restriction as a prior. 


59. This relationship can also be used to validate the model, before it is used to adjust the 
model. 
60. This method of deriving small area estimates is called synthetic estimation. We are 


also considering extending this to a composite synthetic approach. This method uses the 
synthetic estimates with direct estimates, giving more weight to the direct estimates when 
they are more reliable. The new estimate is calculated as: 


ye =wyY,+d-w,)Y,,; 


where * is a direct estimator, ” is a synthetic estimate and " is a weight, which can be 
estimated in a number of ways, but usually use the standard errors in some way, so that direct 
estimates with a higher standard error get less weight. Ghosh, M & Rao, J (1994) have a 
discussion of the choice of weights for this model. 


Key Issue - Introducing restrictions into the model 


Is there an easy way to include a restriction that the sum of the smaller areas equals the 
larger survey area in the WLS model we are using? 


Step 6 - Validate the small area estimates 


61. Once estimates for small areas are derived three broad strategies are used to check or 
validate their accuracy: 


i) Internal consistency checks are undertaken 


62. One internal consistency check for the model is to use the survey estimates as a check 
of the model. In theory, the small area estimates should sum to the survey data. In practice, 
the OLS method is an approximation, so the small areas estimates of victimisation will not 
necessarily sum to the survey weighted estimates. The difference will give us some idea of 
how good the small area estimates are. 


63. If the small area totals differ from the survey totals (see Step 5 above), the survey 
estimates can be used to scale the small area estimates so that they add up to the survey 
estimate. The problem with this is that the model is then being 'forced' to fit the survey data. 
The estimates are no longer efficient, and the variance of the estimates increases. 


i1) Test to see how sensitive the results are to a change in the inputs; 


64. This method tests the sensitivity of the model to changes in the data. If the change to 
the results is large, then the model is sensitive to any changes to the inputs, and may give 
widely varying results depending on the inputs. This would signal that care should be used 
when reporting the results of the model. To use this method, one would need to have 
adjusted for survey design in the calculation of the model, since the standard errors are used 
for the sensitivity analysis. These will be incorrect if the survey design has not been taken 
into account in the modelling. 


1ii) Compare the results to estimates derived from another method, source of data, or model. 


65. This method tests the model against some external benchmark. For instance, the 
estimates from the model could be run against the 1991 Census and the results could be 
compared to the 1993 Crime and Safety Survey data (which was equivalent to the 1998 
Survey for some crimes). If the model estimates are significantly different from the survey 
estimates, and there is no explanation for the difference, then the model needs to be 
questioned. The problem with this approach is the reporting rate for the Indigenous indicator 
on the Census seems to have changed between the 1991 and 1996 Census. Between the 1991 
and 1996 Census, there was a 33% increase in indigenous persons. The ABS has calculated 
that 1/2 of this can be accounted for by births and deaths. The other half is an increased 
propensity by parents to identify children as indigenous. This means that if indigenous status 
is used as an estimator for small area crime rates, the coefficient applied in 1996 would be 
different to that applied in 1991. 


66. Another external benchmark is the New South Wales series of Crime Victimisation 
Surveys. A model could be run against these surveys, and then small area estimates could be 
derived for the 10 year series. To do this, one would also need the independent variables 
available on an annual basis. Much of the data may be available annually from other sources, 
although probably not for small areas. Therefore, this validation technique may be difficult. 


67. Again, any large and unexplained movements over the 10 years would suggest a 
problem with the model. 


68. Victoria also conducts a crime victimisation survey, the last one being in 1999. They 
have published results at a Local Government Area (LGA) level, so one could use the 
proposed model to estimate victimisation rates by LGA in Victoria, and compare these with 
the published Victorian estimates from their survey. 


69. A further source of data could be Police estimates of crimes reported at a small area 
level. This method would test both the small area estimates model and the reporting model. 


IKey Issue - Validating the model 


Have we covered all the options for validating the model? Is there anything else we could 


3.3 Preliminary Results 


70. Some results may be available before the June meeting, and if anything useful is 
available, we will send copies to MAC members. 


3.4. Other relevant issues excluded from the study due to unavailability of data 
3.4.1 Police as an independent variable 
71. The number of police in an area is likely to have an effect on victimisation rates. 


However the magnitude and direction of this effect is not agreed by experts (see Marvell & 
Moody: 1996) 


422 Unfortunately, for small areas, police presence is very difficult to quantify. States 
may have police that regularly move between areas. The only information readily available is 
police expenditure and this is only available at a State level (SCRCSSP: 2000). Even if the 
data were available obtaining it would be difficult due to its sensitive nature (ABS: 2000). 
Therefore the proposal is that police is not included as an independent variable in the model. 


3.4.2 Drug related offences 


43. Many break-ins may be drug related, so areas where there are more drug-dependents 
may have higher break and enter victimisation rates. 


74. The NCSS is a 'victimisation' survey - The fundamental question being asked is ‘have 
you been a victim of crime?'. To study the issue of whether break and enter victimisations are 
drug related, offender data, and in particular, motivation for the crime, would be required. 
These data are unavailable in the NCSS and data from Justice System may be difficult to 
obtain and may not be appropriate for use in this study (ABS: 2000). 


1D: The NCSS has a neighbourhood perception question: 


"What are the problems from crime or people creating a public nuisance in your 
neighbourhood?’ 


70. This question was asked of all respondents and one possible response specified illegal 
drugs, and another was break-ins. Because break-ins may be associated with illegal drugs, it 
is difficult to associate these two perceptions from the survey. Further, the respondent could 
choose as many problems as they perceived as being a nuisance. This makes it difficult to 
interpret the priority of the ‘perceptions’. The question is also a subjective question - what one 
person interprets as ‘drug problems' may be different from what another perceives as drug 
problems. 


77. Illegal drugs are not perceived as a major problem, ranking as sixth most perceived 
neighbourhood problem. This places them almost equal with drunkenness and louts/gangs. 
Because of the problems associated with getting data on drug offenders, this study does not 
include drug use in the model. 


3.4.3, Offenders and victims travelling between areas 


78. If a person from a given area is victimised in another area (say, inner city), the NCSS 
enumerates the victimisation where the person resides. This means that in some cases the 
small area model which assumes that place of residence is the same as place of victimisation 
may be erroneous. 


79. For better results one would need data on where the victimisation took place and 
assign it to that given area. From the survey, there is no question that allows us to identify 
where the crime took place. It is also difficult to measure the extent of the problem of 
mismatch between the place of residence and the place of victimisation. 


80. The NCSS has a question relating to where the incident take place. Possible 
responses include At home, At place of work or study, In a place of entertainment, etc. 
Analysis of these responses reveals that 35% of all personal crime occurs at home. It can be 
assumed that for 35% of cases (total personal crime) this issue is not a problem. For the 
remaining 65% it may be. Unfortunately, without data on the location of the offence, there is 
nothing that can be done about this problem, except to recognise it and document it in our 
results. 


4. Propensity to report crime 


81. The aim of this project is to identify victim characteristics that affect the decision to 
report a crime. 


4.1 Method 


82. The decision to report crime is a binary response variable (0/1). It is therefore 
proposed that a logit regression model be used to model the relationship between the decision 
to report and other victim characteristics. This is similar to the method used in other studies 
of reporting behaviour (see Skogan, W: 1994; Kury, H, Teske, R & Wurger, M: 1999; and 
Carcach, C: 1997). A bivariate probit model has also been used in the literature (Macdonald, 
Z: 2001), which also models the Yes/No victimisation variable. The logit regression takes 
victimisation from the survey. 


83. The estimates would be calculated by crime type (personal and property); and by 
crime, if the data allows it. The dependent variable would be the propensity to report the 
latest victimisation. There is not enough information on victimisations before the latest one 
(any victimisation besides the latest has a smaller set of data attached to it in the survey), so 
initially only the latest victimisation will be considered. 


84. The SAS procedure SURVEYREG will not estimate logistic regressions. However, 
the program WESVAR does, and with replicate weights, we can calculate the standard errors 
for a WLS estimation. 

85. The models we intend to estimate are: 

1. Model with no adjustment, and no weighting on the unit record file; 


2. Model including the survey weight as a weight in WLS; 


3. Model including a survey weight adjusted to the survey population as the weight for 
WLS; 


Model including 1/SE (calculated in SAS using the replicate weights) as the weight for WLS, 
and the survey estimates increased to populaiton estimates using the survey weights. 


IKey Issue - Propensity to report method 


Is the logit regression model the best model for this problem? Are there any better models 
hat could be considered? 


4.2 Data 


86. Appendix 5 shows a ‘wish list' of data. Problems due to the quality of the data and the 
relative standard errors may mean that some of the data will be aggregated, or will be 
included in one model but not in another. 


87. Most of the independent variables are binary variables, with the exception being the 
number of previous victimisations. All the data come from the 1998 National Crime and 
Safety Survey. 


4.3 Preliminary Results 


88. Some results may be available before the June meeting, and if anything useful is 
available, we will send copies to MAC members. 


4.4 Validating the model 


89. Once estimates for the propensity to report for different groups are derived, how does 
one check, or validate, them? There are two broad methods: 


1) Test to see how sensitive the results are to a change in the inputs; and 
ii) Compare the results to estimates derived from another method, source of data, or model. 


90. The first method tests the sensitivity of the model to selected data changes. If the 
change to the results is large, then the model is sensitive to changes in the selected variables, 
and may give widely varying results depending on the inputs. Note that this validation 
method requires that the effect of the sample design has been taken into account in the 
modelling stage, since it uses the standard errors to calculate the sensitivity of the input 
variable. 


91. A test of the reporting behaviour could be to check the reporting propensities against 
Police estimates of crimes reported. However, there are differences between the NCSS 
reporting data and Police data. The reporting question on the 1998 National Crime and 
Safety Survey asked "Did you tell the Police about the most recent incident?". In some cases, 
someone else (friend or family) will report the offence to police. In other cases, the police 
will discover the offence themselves; or may not record the offence in their system because it 
is a minor offence. They also may classify the offence differently from what the victim 
classifies it as on the NCSS. Therefore, the reported number will not match Police reported 
statistics (see ABS : 1999a, p. 82 for a comparison of the NCSS and Police data). 


92. One could also run the model against the 1993 National Crime and Safety Survey for 
those crimes that did not have definitional changes between 1993 and 1998, and compare the 
results. 


IKey Issue - Validating the propensity to report model 
Are there any other ways the model could be validated? 
5. Conclusion 


93. The 1998 National Crime and Safety Survey has been used in two modelling 
exercises. The first was to calculate small area estimates of victimisation using victimisation 
data from the survey and socio-demographic data from the 1996 Census. Because the Census 
was conducted in August 1996; and the 1998 Survey asked about victimisation over the past 
year; the data are temporally fairly close. 


94. Applying regression analysis to this survey raised the issue of how to adjust for 
sample design. We propose to deal with this in a number of different ways, and compare the 
results. 


Oo; The small area estimation model has raised some interesting questions about the 
accuracy of survey estimates, and how they can be enhanced for small areas. 


96. The propensity to report crime model used only the survey data; and used a logit 
model to estimate reporting behaviour for different groups. 


ie While we have not presented any results or conclusions from our models in this paper, 
if reasonable results are available before the MAC meeting, we will certainly send these out 
to the members. 


Appendix 1 
Taylor Expansion Method 


The Taylor series expansion can be used to estimate the covariance-variance matrix of the 
estimated regression coefficients. For a full description of this method, see Fuller: 1975. 


For a stratified, clustered sample, observations are represented by a matrix 
(w, y, X)= (Wij nig Xniz) 


where w denotes the sampling weights, y denotes the dependent variable, X denotes the 
design matrix, h is the stratum number with a total of H strata, i is the cluster number with a 


total of ”* clusters, j is the unit number within cluster i of stratum h, with a total of ”” units. 


A 


The estimate of B is calculated as 
B =(X'‘WX)' xX wy 


Let 


The following vectors are calculated: 


Cn = Wai nig Xn 


and the p*p matrix 


H Nh 


Ga Ay OS 64, - 8. Cn -B.) 


Nn-Pia n,—-1 


where ‘i is the sampling rate for stratum h, and all the other variables are defined above. The 
covariance matrix of ? is then calculated as: 


V =(X WX) 'G(X Wxy? 


Appendix 2 
List of variables with a Not Stated classification 


ariable escription % Not 
Stated 
Ove I ype of dwelling (separate house, flat, etc) 0.99 
VISIBLE Peeling visibility (completely seen, partially screened, not 
isible, etc) 
LOCATED welling location (near a laneway, on a corner, in a dead end St., 
tc) 


ARTRAFF (CARTRAFF [Amount of vehicle traffic (constant, peak hour only, etc) 0.46 
CARTEATE Pedestrian traffic (constant, peak hours only, etc) oe 
[CLOSETO | [CLOSETO Live close to park, school, pub, shops, etc. | 340 | 40 


RESIDED ength of residence at current address (less than 1 yr., 1 to 3 yr., 0.36 
tc) 
‘ice Dwelling tenure (renting, own, buying, etc) 


IHAVEDOG | hether household has a lWhetherhouseholdhasadog i asst (aisststs—<isi‘“‘“‘<—<‘<;CStsts | 161 | | 161 | 
RIVEWA [Whether there is a car in the driveway (all the time, sometimes, 1.11 
‘aaa arely, etc) 

LIGHT Whether there is outside lighting (street light, sensor light, etc) there is outside Whether there is outside lighting (street light, sensor light, etc) (street light, sensor light, etc) | 1.00 | | 1.00 | 


WINDOW |Window security (bar, grills, etc) 1.71 
DEADLOK |Whether doors have deadlocks 1.70 
SCREEN Whether have security screen doors 1.19 
ALARM Whether have an alarm system Zoe 
pee Whether security measures have been added in the last 12 mths 3.40 


(CRMHRE | eason not Reason not told the police (break andenter) = i stst—S the Reason not told the police (break andenter) = i stst—S (break and enter) | 131 | | 131 | 


CTIONB [Action of break and enter offender (stole property, damaged 0.01 
roperty, etc) 
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Appendix 3 
Graph of survey weight and 1/Standard Error 
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Appendix 4 
Full description of Census data 


Variables Description E heoretical basis for inclusion in the 
odel 


ictim variables (personal crime) from 1998 NCSS (X Matrix) 
ale aged 15 - [Male victims aged 15 - 25/all persons [Lifestyle theory: Males aged 15 to 25 


2) ged 15 and over, used for Assault e more likely to place themselves in 

nly Situations where they are more likely 
o become victims of crime. 

emale emale victims/all persons aged 15 emales are more likely to be victims 

nd over, used for Assault only f personal crime, particularly single 
others. 

ivorced/separDivorced or separated victims/all ocial disorganisation theory - 

ted ersons aged 15 and over, used for ivorced or separated persons are 
ssault and Sexual Assault only ore likely to be victims of crime. 


ocio-Demographic characteristics from 1996 Census (Y Matrix) 

edian weekly household income. ncome, in particular low income, and 
his is the income for a household, elative differences in income are 
hich may not contain a family (eg, ssociated with the Economic, Strain 
roup households). nd Social disorganisation theories of 
owest quartile of household income. (crime. 

kewness measure of the lack of 
Symmetry around the mean. 
edian weekly individual income. s above 
owest quartile of individual income. 

kewness measure of the lack of 

Symmetry around the mean. 

edian weekly family income. This is|As above 
he income for a family unit in a 
ouse. 

owest quartile of family income. 
kewness measure of the lack of 
Symmetry around the mean. 
roportion of families in the area 
ithin the lowest quartile of total 
income from the survey. 
ndigenous persons as a % of the 
opulation 


amily income 


ocial disorganisation theory - Low 
income areas are generally associated 
ith more crime. 

ocial disorganisation theory - Greater 
he heterogeneity of a given 

opulation, less social bonds formed. 
ub-culture theory - Variable included 
o test the hypothesis that cultural 
inorities are associated with higher 
rime. 


ustralian ustralian born persons of Australian See note about indigenous persons. 
or parents as a % of the population 
Low fluency in Persons with no or poor English skills Bee note about indigenous persons. 


English las a % of population | 
nemploymen |Unemployed persons as a % of the 
otal population aged 15 and over 


train theory - Unemployment causes 
inancial hardship, and this is a 
otivation for crime intended to 
elieve hardship. Economic hardship 
ay also give rise to frustrations that 
increase the frequency of aggressive 
cts. 

ifestyle theory: males aged 15 to 25 
e more likely to be offenders. 


ale persons aged 15 - 25 asa % of 
he population 


ingle parent with one or more 
ependant children persons as % of all 
amilies 

ersons with no post-secondary 
ualifications as a % of the population 


ocial disorganisation theory - Family 
isruption is often a primary cause of 
conomic hardship. 

his variable is highly linked to 
nemployment, income and poverty 
ariables. It will be included initially, 
ut may be dropped if 

ulticollinearity is a problem. 


niversity ersons with post secondary s above 
egrees ualifications as a % of the population 
eographic Region characteristics from 1996 Census (Z Matrix) 
iving alone ouseholds with Person living heory of guardianship - Persons 
lone/All households iving alone are easier victims than 
roups. Household left 'unguarded' for 
onger periods. 
welling ented households / All households ocial disorganisation theory - Rented 
enure ouseholds tend to be more mobile 
orming less bonds with the 
ommunity. Rent to a much lesser 
xtent taps into the concept of 'wealth’. 
ouseholds who own their own home 
aybe seen as 'wealthier' than those 


ho don't. 
opulation umber of persons per square conomic and social disorganisation 
ensity ilometre heories - High density areas have 


elatively more opportunities for 
rime, more anonymity and less 
ommunity bonding. 


welling type [Households in separate house/All imilar to the 'density' variable - see 
1 ouseholds bove 
welling type [Households in flats, units or ee above 
partment/AlIl households 


esidential amilies who changed address during Social disorganisation theory - People 
obility 1 he last year as a % of all families ho are more mobile form less bonds 
ith the community. 


esidential amilies who changed address during |As above. 
obility 5 he last 5 years as a % of all families 


Appendix 5 
Data for the propensity to report model 


Variables a f heory 
eriousness Variables s an offence becomes more serious, 
an alae ane rates will increase. 


Whether [| Whether something was stolen | was stolen [Binary | 


Whether victim was physically inary 
attacked 
Whether victim was threatened with [Binary 
violence 


Whether a weapon was used [Binary | 


Whether [| Whether the victim was injured | victim was [| Whether the victim was injured | 


Whether there was more than | a 

offender 

Whether victim lived alone one victims are more likely to report 
security issues) 


elationship Variables (only available he closer the relationship between the 
or Assault) ffender and victim, the less likely the 
ictim is to report the offence. 


Victim was married to offender Binary 
Victim was boyfriend/girlfriend Binary 
Victim was ex-partner Binary 
Victim was ex boyfriend/girlfriend Binary 


Victim was partner or ex-partner inary ncidents occurring at home involving 
AND offence occurred at home urrent family members are less likely 
o be reported. 


rime Variables 


Number of times the offence iscrete epeat victims are less likely to report 
occurred in the last 12 months. epeat victimisations. 


ndividual Socio-Demographic ome socioeconomic groups do not 
ariables ave a good relationship with the 
olice, and are less inclined to report 
ictimisation. 


Sex inary emales are more likely to report 
ffences to the police. 

Sex * Victim knew offender inary emales are less likely to report if 
ffender known. 


EE SSS 15 - EE SSS [Binary | oung [Young people are less inclined to report] are [Young people are less inclined to report] inclined to report 


Age 65 and over Binary ae people are more inclined to 
eport 


Males aged 15 - 24 Binary Young Males are less inclined to report. 
Unemployed Binary Unemployed are less likely to report. 


ousehold Socio-Demographic 
ariables 


Whether victim was sole parent with Binary Sole parents with young children more 


| children under 15. | likely to report (security issues). | 
Whether been at current address 5 Binary que to current area is associated 
years or more. ith greater propensity to report. 
Whether own home or not Binary eons to current area is associated 
ith greater propensity to report. 


Whether have security devices i ecurity devices associated with more 
installed ttachment to house which is associated 
i 


ith higher reporting. 


inary 
Whether area has perceived inary ouseholds in criminogenic areas have 
problems with property crime, ower propensity to report crime. 
violent crime, drug crime, uestion used is the ‘Perceptions of 
drunkenness, louts/youths/gangs, ea’ question from the NCSS. 

men 


prowlers/loiterers, vandalism, 
dangerous noisy driving, etc. 


Whether household in high risk area |Bin Households in these areas have a lower 
(100 metres from a school, railway propensity to report crime. These data 


station, pub, hotel, etc. 


ome from questions in the NCSS. 
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